Linear mixed model selection for false discovery rate control in microarray data analysis.
نویسندگان
چکیده
In a microarray experiment, one experimental design is used to obtain expression measures for all genes. One popular analysis method involves fitting the same linear mixed model for each gene, obtaining gene-specific p-values for tests of interest involving fixed effects, and then choosing a threshold for significance that is intended to control false discovery rate (FDR) at a desired level. When one or more random factors have zero variance components for some genes, the standard practice of fitting the same full linear mixed model for all genes can result in failure to control FDR. We propose a new method that combines results from the fit of full and selected linear mixed models to identify differentially expressed genes and provide FDR control at target levels when the true underlying random effects structure varies across genes.
منابع مشابه
The False Discovery Rate in Simultaneous Fisher and Adjusted Permutation Hypothesis Testing on Microarray Data
Background and Objectives: In recent years, new technologies have led to produce a large amount of data and in the field of biology, microarray technology has also dramatically developed. Meanwhile, the Fisher test is used to compare the control group with two or more experimental groups and also to detect the differentially expressed genes. In this study, the false discovery rate was investiga...
متن کاملModel-Based Approach to FDR Estimation
False Discovery Rate (FDR) has become commonly used in multiple comparisons, in part driven by studies involving large numbers of parameters, e.g. microarray data analysis. FDR is a philosophically different approach from traditional hypothesis testing, and it offers a conceptually consistent framework for multiple hypothesis testing when the primary objective focuses on discovery instead of te...
متن کاملEstimating the False Discovery Rate Using Mixed Normal Distribution for Identifying Differentially Expressed Genes in Microarray Data Analysis
The recent development of DNA microarray technology allows us to measure simultaneously the expression levels of thousands of genes and to identify truly correlated genes with anticancer drug response (differentially expressed genes) from many candidate genes. Significance Analysis of Microarray (SAM) is often used to estimate the false discovery rate (FDR), which is an index for optimizing the...
متن کاملLaplace Approximated EM Microarray Analysis: An Empirical Bayes Approach for Comparative Microarray Experiments
A two-groups mixed-effects model for the comparison of (normalized) microarray data from two treatment groups is considered. Most competing parametric methods that have appeared in the literature are obtained as special cases or by minor modification of the proposed model. Approximate maximum likelihood fitting is accomplished via a fast and scalable algorithm, which we call LEMMA (Laplace appr...
متن کاملDetecting Differentially Expressed Genes Using Calibrated Bayes Factors
A common interest in microarray data analysis is to identify genes having changes in expression values between different biological conditions. The existing methods include using two-sample t-statistics, modified t-statistics (SAM), Bayesian t-statistics (Cyber-T), semiparametric hierarchical Bayesian models, and nonparametric permutation tests. All these methods essentially compare two populat...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- Biometrics
دوره 66 2 شماره
صفحات -
تاریخ انتشار 2010